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AE015424 10029 bp DNA linear BCT 18-OCT-2002 

Shigella flexneri 2a str. 301 section 387 of 412 of the complete 
genome. 

AE015424 AE005674 
AE015424. 1 

• 

Shigel la f lexneri 2a str. 301 
Sh igel la f lexneri 2a str. 301 

Bacteria; Proteobacter ia; Gammaproteobacter i a ; Enterobacter i a I es ; 
Enterobacter i aceae; Shigella. 

1 (bases 1 to 10029) 

Jin.Q., Yuan,Z.H., Xu.J.G., Wang,Y., Shen,Y., Lu.W.C., Wang,J.H., 
Liu,H. , Yang, J., Yang, F. , Qu,D. , Zhang, X. B. , Zhang, J.Y., Yang, G. W. , 
Wu.H.T., Dong,J., Sun.LL. , Xue,Y., Zhao,A.L., Gao,Y.S., Zhu,J.P., 
Kan,B. , Chen,S.X., Yao.Z.J., He, B. K. , Chen,R.S., Ma, D. L. , 
Qiang,B.Q., Wen.Y.M. , Hou,Y.D. and Yu,J. 
Genome sequence of Shigella flexneri 2a: insights into 
pathogenicity through comparison with genomes of Escherichia coli 
K12 and 0157 

Nucleic Acids Res. 30 (20), 4432-4441 (2002) 
12384590 

2 (bases 1 to 10029) 

Jin,Q. f Shen,Y., Wang,J.H., Li u, H. , Yang,J., Yang,F., Zhang, X.B., 
Zhang, J.Y., Yang,G.W., Wu,H.T., Dong.J., Sun.L.L., Xue,Y., 
Zhao,A.L., Gao.Y.S., Zhu,J.P., Chen, S. X. , Yao.Z.J., Wang,Y., 
Lu.W.C, Qiang,B.Q., Wen,Y.M. and Hou.Y.D. 
Direct Submission 

Submitted (08-MAY-2001) Microbial Genome Center of Chinese Ministry 
of Public Health, 100 YingXin Jie, XuanWu Qu, Beijing 100052, P. R. 
China 

Locat ion/Qua) i f iers 
1.. 10029 

/organism="Shigel la flexneri 2a str. 301" 

/strain=" 301" 

/serotype= M 2a" 

/db_xref="taxon:198214" 

compl ement (86. . 1039) 

/gene=" aphA" 

/note="SF4149" 

compl ement (86. . 1039) 

/gene^aphA"' 

/no te=" Residues 81 to 317 of 317 are 100 pet identical to 
residues 1 to 237 of a 237 aa protein from Escherichia 
coli K12 ref: NP_418479.T 
/codon_star t=1 
/transl_table=11 

/product=" d i adenos i ne tet raphosphatase" 
/protein_id="AAN45571. 1" 
/db_xref="GI:24054660" 

/ 1 r an s I a t i on=" MPR I HHNKRK I NVSLHRV I N I FYTKKDRNVFTGLN I II LLF I TK 

TADNPSFPRQLALWSDGFCNKSHNKKFQHTDYLGKNMRK I TQA I SAVCLLFALNSSAV 

ALASSPSPLNPGTNVARLAEQAP IHWVSVAQIENSLAGRPPMAVGFDIDDTVLFSSPG 

FWRGKKTFSPESEDYLKNPVFWEKMNNGWDEFS I PKEVARQL I DMHVRRGDA I FFVTG 

RSPTKTETVSKTLADNFH I PATNMNPV I FAGDKPGQNTKSQWLQDKN I R I FYGDSDND 

I TAARDVGARG I R I LRASNSTYKPLPQAGAFGEEV I VNSEY" 

1162. . 1569 

/gene= M SF4150" 

1162. . 1569 

/gene="SF4150 t ' 

/note="Residues 14 to 59 of 135 are 97 pet identical to 
residues 1 to 46 of a 67 aa protein YJBS_EC0LI sp: P58036" 
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/codon_start=1 
/ trans l_tab I e=1 1 

/product=" orf , partial conserved hypothetical protein" 
/protei n_id= n AAN45572. 1" 
/db_xref= n GI :2405466f 

/trans I at i on=" MV I NF I TRDGDDDMN rSYVNSNKTTSLSVELDALNNKD I SYAKD 

FFLY I ETQLK I AKDFCRPGEEVSSS I ASK VFHAF I DLVNK I RGKKDFMY I CTLCCFAE 

EVKGDYSHYRTFLFD I GNQYKVKLTQSGKKSSL" 
gene comp I ement (1 908. .3143) 

/gene=" ty rB n 

/note="SF4151" 
CDS complement (1908. . 3143) 

/gene= n tyrB" 

/note=" Residues 15 to 411 of 411 are 98 pet identical to 
residues 1 to 397 of a 397 aa protein from Escherichia 
coli 0157:H7 EDL933 ref: NP_290687. 1" 
/codon_star t=1 
/transl_table=1 1 

/product=" tyrosine aminotransferase, tyrosine repressible" 
/protei n_id=" AAN45573. T 
/db_xref="GI :24054662" 

/trans I at i on=" MCFNHLPVNLENHRVFQKVDAYAGDP I LTLMERFKEDPRSDKVN 
LS I GLYYNEDG I I PQLKAVAEAEARLNAQPHGASLYLPMEGLNSYRHA I APLLFGADH 
PVLQQQRVAT I QTLGGSGALKVGADFLKRYFPESGVWVSDPTWENHVA I FAGAGFEVS 
TYPWYDEVTNS VRFNDLLATLKTLPARS I VLLHPCCHNPTGADLTNDQWDAV I E I LKA 
REL I PFLD I AYQGFSAGMEEDAYA I RA I ASAGLPALVSNSFSK I FSLYGERVGGLSVL 
CEDAEAAGRVLGQLKATVRRNYSSPPNFGAQ VVAAVLNDEALKASWLAEVEEMRTR I L 
AMRQELVKVLSTEMPERNFDYLLNQRGMFS YTGLSAAQVDRLREEFGVYL I ASGRMCV 
AGLNTANVQRVAKAFAAVM" 

gene complement (3354. . 4433) 

/gene= M a I r" 
/note= w SF4152 w 

CDS compl ement (3354. . 4433) 

/gene^" at r" 

/note="Residues 1 to 359 of 359 are 100 pet identical to 

residues 1 to 359 of a 359 aa protein from Escherichia 

col i K12 ref: NP_418477. 1" 

/codon_start=1 

/trans l_table=1 1 

/produc t=" a I an i ne racemase T 

/protein_id="AAN45574. T 

/db_xref= tt Gl :24054663" 

/t r ans I a t i on=" MQAATVV I NRRALRHNLQRLRELAPASKMVAVVKANAYGHGLLE 
TARTLPDADAFGVARLEEALRLRAGG I TKPVLLLEGFFDARDLPT I SAQHFHTAVHNE 
EQLAALEEASLDEPVTVWMKLDTGMHRLGVRPEQAEAFYHRLTQCKNVRQPVN I VSHF 
ARADEPKCGATEKQLA I FNTFCEGKPGQRS I AASGG I LLWPQSHFDWVRPG 1 1 LYGVS 
PLEDRSTGADFGCQPVMSLTSSL I AVREHKAGEPVGYGGTWVSERDTRLGVVAMGYGD 
GYPRAAPSGTPVLVNGREVP I VGRVAMDM I CVDLGPQAQDKAGDPV I LWGEGLPVER I 
AEMTKVSAYEL I TRLTSRVAMKYVD" 

gene comp I ement (4486. . 5901) 

/gene=" dnaB" 
/note="SF4154 M 

CDS compl ement (4486. . 5901) 

/gene="dnaB* 

/note="Residues 1 to 471 of 471 are 100 pet identical to 

residues 1 to 471 of a 471 aa protein from Escherichia 

coli K12 ref: NP_418476. 1" 

/codon_start=1 

/tran$Mable=11 

/product^" repl icat i ve DNA helicase; part of primosome' 1 
/protein_id= ,, AAN45575. 1" 
/db_xref="G! :24054664 t1 

/trans lat i on=" MAGNKPFNKQQAEPRERDPQVAGLKVPPHS I EAEQSVLGGLMLD 
NERWDDVAERVVADDFYTRPHRH I FTEMARLQESGSP I DL I TLAESLERQGQLDSVGG 
FAYLAELSKNTPSAAN I SAYAD I VRERAV VREM I SVANE I AEAGFDPQGRTSEDLLDL 
AESRVFK I AESRANKDEGPKN I ADVLDATVAR 1 EQLFQQPHDGVTGVNTGYDDLNKKT 
AGLQPSDL 1 1 VAARPSMGKTTFAMNLVENAAMLQDKPVL I FSLEMPSEQ I MMRSLASL 
SRVDQTK i RTGQLDDEDWAR I SGTMG I LLEKRN I Y I DDSSGLTPTEVRSRARR I AREH 
GG I GL I M I DYLQLMRVPALSDNRTLE I AE I SRSLKALAKELNVPVVALSQLNRSLEQR 
ADKRPVNSDLRESGS I EQDADL I MF I YRDEVYHENSDLKG I AE 1 1 I GKQRNGP IGTVR 
LTFNGQWSRFDNYAGPQYDDE" 
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gene 5876.. 6967 

/gene= n qor" 

/note= w SF4153 n 
CDS S876..6967 

/gene^qor" 

/note=" Residues 37 to 363 of 363 are 99 pet identical to 
residues 1 to 327 of a 327 aa protein from Escherichia 
coli 0157:H7 EDL933 ref: NP_290684. 1" 
/codon_star t=1 
/ 1 r an s i_t ab I e=1 1 

/produc t=" qui none ox i do reductase" 
/protein_id="AAN45576. 1" 
/db_xref="GI : 24054665" 

/translation^ MLKGLFPA I VNGVTE I KNGSKLYHMKQTLT 1 RSGGHMATR I EFH 
KHGGPEVLQAVEFTPADPAENE I QVENKA IGINFIDTY I RSGLYPPPSLPSGLGTEAA 
G I VSK VGSGVKH I KAGDHVVYAQSALGAYSS VHN I NADKAA I LPAA I SFEQAAASFLK 
GLTVYYLLRKTYE I KPDEQFLFHAAAGG I GL I ACQWAKALGAKL I GTVGTAQKAQSAL 
KAGAWQV I NYREENLVERLKE I TGGKKVRVVYDSVGRDTWERSLDCLQRRGLMVSFGN 
SSGAVTGVNLG I LNQKGSLYVTRPSLQGY I TTREELTEASNELFSL I ASGV I KVDVAE 
QQKYPLKDAQRAHE I LESRATQGSSLL I P" 

gene compl ement (71 33. . 7585) 

/gene= n yj bO" 
/note= n SF4155 M 

CDS compl ement (71 33. . 7585) 

/gene^" yj bO" 

/note=" Res idues 1 to 151 of 151 are 99 pet identical to 
residues 1 to 151 of a 151 aa protein from Escherichia 
col i K12 gi: 1790484" 
/codon_start=1 
/trans I _ t ab I e=1 1 

/product=" orf , conserved hypothetical protein" 
/protein_id= n AAN45577. 1" 
/dbjcref="GI : 24054666" 

/translation^ MKMP I KRVLTLTCWNTRSNWWR I SVNFSPKSSQ I HHALRTVAGR 
FAVKS I DYFWHDSCNASKRFH I WES I MLELLFV I GFFVMLMVTGVSLLG 1 1 AALVVAT 
A I MFLGGMLALM I KLLPWLLLA I AVVWV I KA I KAPKMPKYQRYDRWRY" 

gene complement (7509. . 8552) 

/gene=" yjblf 
/note="SF4156 n 

CDS compl ement (7509. . 8552) 

/gene= rt yj bN" 

/note="Residues 3 to 347 of 347 are 99 pet identical to 
residues 1 to 345 of a 345 aa protein from Escherichia 
col i 0157:H7 ref: NP_313058. 1" 
/codon_star t=1 
/trans l_t ab i e=l 1 

/produc t=" orf , conserved hypothetical protein" 

/protein_id="AAN45578.r 

/db_xref="GI : 24054667" 

/ 1 r ans I a t i on=" MKMHGNSEMQK I NQTSAMPEKTDVHWSGRFSVAPMLDWTDRHCR 
YFLRLLSRNTLLYTEMVTTGAI IHGKGDYLAYSEEEHPVALQLGGSDPAALAQCAKLA 
EARGYDE I NLNVGCPSDRVQNGMFGACLMGNAQLVADCVKAMRDVVS I PVTVKTR I G I 
DDQDSYEFLCDF I NTVSGKGECEMF I I HARKAWLSGLSPKENRE I PPLDYPRVYQLKR 
DFPHLTMS I NGG I KSLEEAKAHLQHMDGVMVGREAYQNPG I LAAVDRE ! FGSSDTDAD 
PVAVVRAMYPY I EHELSQGTYLGH 1 1 RHMLGLFQG I PGARQWRRYLSENAHKAGAD I N 
VLEHALKLVADKR" 

gene complement (8909. . 9616) 

/gene="yj bM" 
/note= n SF4157" 

CDS compl ement (8909. . 9616) 

/gene= n yjbM n 

/note="Residues 1 to 235 of 235 are 98 pet identical to 
residues 1 to 235 of a 235 aa protein from Escherichia 
col i K12 ref: NP„418472. V 
/codon_start=1 
/trans I tab I e=1 1 

/product=" orf , conserved hypothetical protein" 
/protein_id=" AAN45579. 1" 
/db_xref="GI :24054668" 

/translation^ MWVNKYTDDCTDEDLNDRDF I AS VVDRA I FHFA I NS I CNPGDNK 
DAMP I EQCTFDVETKNGLPSTVQLFYEESKDNEPLAN I HFQA I GSGFLTFVNACQEHD 
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gene 



CDS 



BASE COUNT 
ORIGIN 

1 

61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 

1021 

1081 

1141 

1201 

1261 

1321 

1381 

1441 

1501 

1561 

1621 

1681 

1741 

1801 

1861 

1921 

1981 

2041 

2101 

2161 

2221 

2281 

2341 

2401 

2461 

2521 

2581 

2641 

2701 

2761 

2821 

2881 

2941 

3001 



2702 

taatcagt tt 
gagagcatca 
acgcacccgc 



DNSLKLFASLL I SLSYSSAYADLSETVY I NENNESYLKAQFEKLYQRDMKKYLGEMKR 

LADGGEMNFDGYLDKMSHLVNEGTLDPD I LSKMRDAAPQL I SFAKSFDPTSKEE I K I L 

TDTSKL I YDLFGVKSEK" 

complement (9640. .9894) 

/gene=" yjbL" 

/note= n SF4158 w 

complement (9640.. 9894) 

/gene^yjbL" 

/note= Residues 1 to 84 of 84 are 96 pet identical to 
residues 1 to 84 of a 84 aa protein from Escherichia coli 
K12 ref : NP_418471. 1" 
/codon_star t=1 
/trans i _t a b I e=1 1 

/product=" orf , conserved hypothetical protein" 
/protein_id="AAN45580. T 
/db_xref= M GI :24054669" 

/t rans I at i on=" MLK 1 1 PGATGYFNKTLNSNQFENKDATKDKLDNRGS I KGKLNN I 
YGKS IDYAALRHRDI I I AK I DLF I QR I THNLWHARKKMCF" 
a 2451 c 2275 g 2601 t 



cacgagcgcc 
ggatattttt 
agat tacegg 
ccgt t tctgt 
catgeatate 
tattcatttt 
tttttttgee 
ccaccgccat 
gaat tggtgc 
atgaggccag 
cc tgtgtgat 
gagat t tgtt 
ctgcggtt tt 
ctttttttgt 
tatgatgaat 
aatgaact ta 
tgaccagccc 
atgaatatat 
ctaaataata 
aaaat tgcaa 
gt 1 1 1 tcacg 
atttgeaege 
tttttatttg 
tc tct t taac 
aggcaaagca 
at taaatt ta 
aggaat tcca 
tc teaegtag 
agtttaattt 
gcaaacgcct 
cege tggega 
aaaceggtat 
atctctgtgc 
tcttctactt 
acctgcgcac 
cccagtacgc 
tegcegtaaa 
ctggcaatgg 
gcaatatcga 
tgatcattag 
ctgcgggcag 
acttegtcat 
tggttttccc 
aaatccgcgc 
tgttgttgca 
ctgt taagee 
gcttccgcct 
ccgatact ta 



gcagcaagac 

ctctctcccg 

t tgtggcaag 

gaeatcgegt 

atcctgcagc 

attcatgttg 

t ttcgtcggg 

aatcagctgg 

t tcccagaac 

gcgccagaac 

tggeggaege 

ctgttcagca 

ggcaacagcg 

ettgegcata 

gcaaaaacca 

tgtgatgaaa 

ataaaaaatg 

cctcggcaaa 

atgatacagc 

cctaccctac 

catatgtgaa 

aagatatctc 

aagat 1 1 ttg 

catttattga 

tttgctgttt 

atat tggtaa 

attagaattt 

tat tcttgag 

ataatggtta 

gataat tggg 

aacaaeggea 

cactgcaggc 

ttgccacacg 

tgagatagac 

aactgaacat 

ttaatacctt 

ccgccagcca 

caaaat tegg 

ggccagcggc 

gggagaaaat 

cgegaatage 

ggaatgggat 

tgagategge 

gtaatgtttt 

accaggggta 

aagtaggatc 

ccactttcaa 

gtaccggatg 

cttccatcgg 

ccgccacggc 

aattcacttt 



agegaaagga 

ctctgtcagt 

ggtttgtagg 

gcggcggtaa 

cat tgegat t 

gtggcaggaa 

ctacgaccag 

cgagcgacct 

acaggatttt 

cccggactgg 

cc tgegagge 

agcctggcaa 

gaactgt t ta 

tttttcccta 

tctgaccata 

agcaggatga 

tttattactc 

tgagcgaaat 

at tacaaaaa 

aatggtgat t 

tagcaacaaa 

ttatgccaaa 

tagacctgga 

tttagttaac 

tgecgaggag 

tcaatataag 

aacgatacta 

gatatagaaa 

gatgtgcaga 

ataaacttaa 

tcaaaaaaat 

tgggtaactc 

ttgtacattt 

accaaat tct 

gccgcgctga 

caccaattcc 

gctggctttc 

egggctggag 

tteggcatet 

tttcgagaac 

gtaggcatcc 

aagctcgegg 

acccgttggg 

cagcgtcgcc 

agtactcact 

gctgacccag 

tgcccctgag 

gtccgcacca 

taaataaagc 

tttcaattgt 

gtegctgega 



taaaggtgtg 
attctgaat t 
tagagttgga 
tategttate 
ttgtattttg 
tatgaaaat t 
tcacaaagaa 
cttttggaat 
tcagataatc 
aaaaaagtac 
tattttcaat 
cgt tagtccc 
gagegaacaa 
aatagtcagt 
acgccagctg 
ttatatttaa 
gatgtaagga 
ttgatagaac 
caatcgaagt 
aattttatta 
acgacatct t 
gatttctttt 
gaagaagtat 
aaaatcaggg 
gt taaaggtg 
gttaaactta 
taattgaatc 
tattctatcg 
aat tattact 
caataaaaga 
catctacaaa 
cagcctgct t 
geegtat tta 
teaegtagtc 
ttaagcagat 
tgacgcattg 
aatgcctcgt 
tagttgegge 
tcacacagaa 
gaattgetea 
tcttccatac 
gctttgagaa 
ttgtggcaac 
aacaggtcat 
tcgaatccag 
aegectgat t 
ccgccaaggg 
aacagcagcg 
gaagegecat 
ggaataattc 
gggtcttct t 



at taaaaaaa 
gacgatcacc 
ggegegcaga 
ggaa tegeca 
c c c t gg 1 1 1 a 
atccgccagc 
gatcgegtea 
gctgaat tea 
ttegctttet 
cgtgtcatcg 
t tgtgcgacc 
agggt taagc 
taagcaaacg 
atgt tgaaac 
ccggggaaag 
cccagtaaat 
taca ttaatt 
atttatctgc 
t tataaagat 
etcgegatgg 
tatc tgttga 
tatacataga 
caagttctat 
gtaagaaaga 
at tattctca 
cacaaagegg 
tcagaaagtc 
taacaaaccc 
ccacacgaaa 
ac tgacaacg 
gaaatagtga 
tectgea t ta 
acccggcgac 
ggtcaacctg 
aatcgaaatt 
ecagaatgeg 
cattcagcac 
gaac tgttgc 
cagaaagtcc 
ecagageggg 
eggcactaaa 
1 1 tcaatcac 
atggatgcag 
taaagegcac 
ccccggcgaa 
ccgggaagta 
tttgaatggt 
gcgcaatggc 
ggggctgege 
cgtcttcgtt 
taaaacgetc 



cagcattegg 
tcttcaccaa 
atgeggatae 
taaaaaattc 
tcgcccgcaa 
gtttt tgaaa 
ccgcggcgta 
tcccagccat 
ggegagaagg 
atatcaaacc 
gaaacccaat 
ggtgaaggag 
gcactgat tg 
tttttattgt 
gaaggat tat 
aegt tcctgt 
ttacgtttat 
tgtacgattt 
gatttctgat 
agatgatgat 
getagatgea 
aacacagt ta 
tgcaagtaaa 
ctttatgtat 
t tacaggacc 
aaaaaagagt 
acaggcaata 
gatacttatt 
aaatagggaa 
ttttattatc 
aagegacgae 
catcaccgca 
acacatgega 
ageggcaett 
gcgttctggc 
agtaegcate 
tgcagccacc 
tttcaat tgc 
gccgacgcgc 
taatccagca 
tccttgatag 
cgcatcccac 
caacacaa ta 
gctgt tagtc 
tat tgctacg 
gcgtt tcagg 
tgctacgcgc 
atggcgatag 
attcaggege 
gtagtacaga 
cataagegta 
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3061 agaatcgggt cgccagcgta ggcgtcaact 

3121 acgggcaggt ggttaaaaca caataaaccg 

3181 gagcgaacgg caattagcaa cagagtgaga 

3241 cgacctttat aagagcatga aagaacatca 

3301 ggataaggca ttcacgccgc atccggcaca 

3361 acgtatttca tcgcgaccct tgaagtcagg 

3421 gtcatttcag cgatacgttc tacgggcaaa 

3481 gctttgtcct gcgcctgtgg acctaagtct 

3541 acaatcggta cttcgcgacc gttcaccagc 

3601 taaccatcgc catagcccat cgcgactacg 

3661 gttccaccat aaccaacagg ctctccggct 

3721 gttagtgaca tcactggctg acagccaaaa 

3781 gagacgccat aaagaatgat gcccgggcgc 

3841 agaatgccac ccgacgcggc aatggaacgt 

3901 aagatagcga gttgtttctc ggttgcgcca 

3961 ctgacgatat tcaccggctg acgaacgttt 

4021 tcagcctgtt ccggccttac gcccagacgg 

4081 gtaaccggct cgtccaagct agcctcttcc 

4141 gcggtatgaa aatgttgcgc agaaatcgtc 

4201 agtaacagta caggtttggt gattcccccc 

4261 gctacgccaa aggcgtcagc atcggggagc 

4321 taagcgttcg ctttcaccac cgcaaccatt 

4381 tgcaggttgt gtcgcagagc gcggcggtta 

4441 ccttgataat tgtttgcttt aattacctaa 

4501 ctgcggcccc gcatagttgt cgaagcgcga 

4561 cccgattggg ccgttacgtt gtttaccgat 

4621 gttttcgtga tacacctcat cacgatagat 

4681 agagccagat tcacgcaggt cggagttgac 

4741 acggttcaac tgggacagcg ccaccaccgg 

4801 cgagcgggag atttctgcaa tttccagcgt 

4861 cagttgcagg tagtcgatca tgataagccc 

4921 tgcgcgggaa cgcacttccg ttggcgtcag 

4981 tttttcgagc aaaataccca tggtgccgga 

5041 cccggtacgg attttagtct ggtcaacgcg 

5101 ctgttctgat ggcatctcca gcgagaagat 

5161 gttttcgacg aggttcatcg caaatgttgt 

5221 gatcaaatcc gacggctgca agccagcggt 

5281 taccccggta acgccatcgt gtggctgctg 

5341 gtcgagcaca tcggcgatgt tcttcggccc 

5401 tttaaagacg cgggattcag caaggtccag 

5461 aaaaccggct tcagcaatct cattcgcaac 

5521 acgcacgatg tccgcataag cactgatgtt 

5581 tgccagataa gcaaaaccac cgacgctatc 

5641 aagggtaatc agatcgatag gactaccgct 

5701 atgacggtgt gggcgggtgt aaaaatcgtc 

5761 gcgttcgtta tccagcatta aaccgcccaa 

5821 aggcactttc agcccggcaa cttgtggatc 

5881 gaagggttta tttcctgcca tagtgaatgg 

5941 ttaccatatg aagcagaccc tgacgatacg 

6001 atttcacaag cacggtggcc cggaagtact 

6061 ggcggagaat gaaatccagg tcgaaaataa 

6121 tatccgcagc ggcctttacc cgccgccatc 

6181 aggcatcgtg agtaaagtcg gcagtggtgt 

6241 ctatgcgcag tcggcgttag gcgcttacag 

6301 ggcgattctg cctgcggcaa tttcttttga 

6361 aacggtttat tatctgctgc gcaaaaccta 

6421 ccacgcagcg gctggcggca ttggcttaat 

6481 gaaactcatc ggcaccgtag gaaccgcgca 

6541 gtggcaggtt attaactatc gtgaagagaa 

6601 cggtaagaaa gtgcgcgtag tgtacgattc 

6661 ggattgcctg caacgccgcg gcttaatggt 

6721 cggtgtgaac ttaggcattc tcaatcaaaa 

6781 gcaaggctat atcaccacgc gggaggaatt 

6841 gattgccagc ggtgtgatta aggtcgatgt 

6901 tgcgcagcgt gcgcatgaga ttctggaaag 

6961 tccataaaag aaatagggct tccacctggg 

7021 gtagggtaca gcacgatgaa tctgttagag 

7081 cctattttgt tctaaggata aaaccttagg 

7141 gccagcggtc ataacgctga tatttcggca 

7201 acaccaccgc aatcgccagt agtaaccacg 

7261 cgccgaggaa cataatggcc gtcgccacaa 



ttttgaaaca cgcgatggtt ctccaggttt 
gaagaaggcg aagatcgagt ggatgttcag 
ctaatgataa acgtacatcc gccagaggca 
acttattgaa tttttaggat tttattggcc 
gacaatcaaa tattacagaa cgattaatcc 
cgcgtaataa gttcgtaagc gcttactttc 
ccttcgcccc ataaaatgac cggatccccg 
acgcagatca tatccatcgc cacgcgcccg 
actggcgtac cggacggcgc ggcgcgcgga 
ccaagacggg tatcacgttc gcttacccag 
ttatgctcac gcacggcaat caggctggag 
tcggcaccgg tggagcgatc ttccagcggc 
acccagtcaa aatgcgactg tggccacagc 
tgaccaggtt tgccttcgca aaaggtatta 
cattttggtt catccgcgcg cgcaaaatgg 
ttgcactggg tcaggcgatg ataaaacgcc 
tgcataccgg tatcgagttt catccagacg 
agcgcagcca gctgttcttc gttatgcacg 
ggcagatctc tggcatcaaa aaagccttcg 
gcacgcagtc gcagagcttc ttcgagacgg 
gttcgcgcgg tctcaagaag accgtgacca 
ttactggcag gtgccagttc acgaagacgt 
atcacaacag ttgccgcttg catttgtgtt 
ttcataaaat aattattatt cgtcgtcgta 
ccattgaccg ttaaaggtca ggcgtaccgt 
aataatttcc gcgatgcctt ttaaatcact 
aaacatgatc aagtccgcat cctgctcgat 
cggacgtttg tcggcacgtt gttccagaga 
cacgttcagt tctttcgcca gtgctttcag 
acggttatcg gaaagcgccg gtacgcgcat 
gatgccgccg tgttcacggg caatacggcg 
gccggaagag tcatcgatat agatattgcg 
aatgcgcgcc cagtcttcat catcgagctg 
cgacagcgac gccagagaac gcatcatgat 
aagtactggt ttatcctgca acattgccgc 
tttacccatc gacggacgcg cggcgacgat 
ttttttgttg agatcgtcat aaccggtgtt 
aaacaactgc tcaatacgcg ccacggttgc 
ttcgtctttg tttgcacgac tttcggcaat 
cagatcttcg ctggtacgcc cctgcggatc 
cgagatcatc tcacggacaa cggcacgttc 
cgccgcactt ggcgtatttt ttgacagctc 
gagttgcccc tggcgttcca acgattccgc 
ttcctgcaaa cgcgccattt cagtaaagat 
tgctaccaca cgctcggcta catcatccca 
caccgactgc tccgcttcga tcgagtgcgg 
gcgttcgcgg ggttcagcct gctgtttgtt 
agttaccgaa ataaagaatg ggtcgaaact 
ttctggagga cacatggcaa cacgaattga 
tcaagccgta gagttcactc ctgccgatcc 
agccatcggc atcaatttta tcgacacgta 
gctacccagc ggattaggta ccgaagcggc 
aaagcatatt aaggctggcg atcatgtagt 
ctctgtgcat aacattaatg cggataaagc 
gcaagctgcg gcatccttcc tgaaaggctt 
tgaaattaaa cccgatgaac agttcctgtt 
tgcctgccag tgggcaaaag ccctgggcgc 
aaaagcgcag agcgcgctaa aagccggcgc 
tctggtcgag cgattgaaag agatcaccgg 
cgtgggcaga gacacctggg aacggtcgct 
cagttttggc aactcatcag gtgcggttac 
aggctcgttg tatgtgacac gcccttccct 
aaccgaggcc agtaatgaac tgttctcttt 
cgccgagcag cagaaatatc cgctgaagga 
ccgggcgacg caaggttcca gcctgctgat 
aagccctttc tttttatagt tcggctgtat 
gcgcaatagt gacagatttg attatcaatt 
ttgtgatcat ccgcacaatc ccttagtaac 
tttttggtgc tttaatcgcc ttaataaccc 
gcagcaactt aatcatcaat gccagcatac 
ccagcgcggc gataatgccc agcaacgaaa 
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7321 cgccggtgac catcagcatg acaaaaaagc caatcacaaa aagtagttcc agcatgatgc 

7381 tctcccaaat atgaaatctc ttgctggcat tacaagaatc atgccaaaaa taatctattg 

7441 atttaacagc aaaacgcccc gcgacggtgc gcagggcgtg gtgaatttga ctactttttg 

7501 gtgaaaagtt aacgcttatc cgccaccagt ttgagcgcgt gttccagcac gttaatgtca 

7561 gcacccgctt tatgggcatt ttcacttaaa taacgccgcc actgccgcgc gccaggaata 

7621 ccctggaaca aacccagcat atgccggata atatggccga gatacgtccc ctggctgagt 

7681 tcatgctcaa tgtacggata catggcacgc actactgcca ccggatcggc atcggtatcc 

7741 gaggaaccaa agatctcccg gtctaccgcc gccagaatac ccggattctg atacgcctcg 

7801 cgcccgacca tcacgccatc catatgttgc aaatgcgctt tagcttcttc cagcgacttg 

7861 ataccaccgt taatcgacat tgtcagatgc gggaagtcac gcttcagttg atacacacgc 

7921 ggataatcga gcggcgggat ttcacggttt tctttcgggc ttaaccccga aagccaggct 

7981 ttacgtgcgt ggatgatgaa catctcacac tcgcctttgc cggaaacggt attgatgaaa 

8041 tcgcagagaa attcatagct gtcctggtcg tcgatgccaa tacgcgtttt caccgtcacc 

8101 ggaatcgaca ccacatcgcg catcgctttc acgcagtcgg caaccagctg cgcattaccc 

8161 atcagacacg caccaaacat gccgttctgc acccggtcag acgggcagcc gacattcagg 

8221 ttgatctcat catatccgcg cgcttctgcc agctttgcac actgcgccag cgccgccgga 

8281 tcgctaccgc ccagttgcaa cgctaccgga tgttcttctt cactgtacgc cagataatca 

8341 cctttaccgt gaattatcgc ccctgtggtc accatttcgg tataaagcaa cgtattgcgg 

8401 gaaagcagac gcaagaaata gcggcaatgt ctgtccgtcc agtcgagcat tggtgcaaca 

8461 ctaaaccgac cactccagtg aacgtcagtt ttttcaggca ttgcgctggt ttggttgatt 

8521 ttttgcattt cagaattacc gtgcattttc aaatgtagag attattttat cgatatatca 

8581 tggggttatg ttattcagct tcactgttca ggaggctcaa tagcggggta ctataccata 

8641 acaacaggaa gcgcctgtct cattgcaaaa gaaaattgag atcctctcaa ggcatgaagc 

8701 tctcacgaag tgatggaaat aatcttatta gccgttaact ttgttaaggc caatgataaa 

8761 caatccaggt tcgacgataa ataaaaaatc acacattaaa ctctggtgat atatctccct 

8821 gctaatagca ttgatagaga aaaaagaacc caataaatat tgggtccttt tatataatgc 

8881 cttccatact atcgaagaac ttcacatatt atttctccga tttaaccccg aacaagtcat 

8941 aaattaattt agaagtgtct gtaagtattt taatctcttc ctttgaggtt gggtcaaacg 

9001 acttcgcgaa gctaattaat tgtggtgcag catctcgcat tttgcttaaa atatcaggat 

9061 cgagcgttcc ttcattcacc agatgtgaca tcttatccag atagccatca aaattcattt 

9121 ctcccccatc tgccagacgc ttcatctctc ctaggtactt cttcatatca cgttgatata 

9181 atttttcaaa ctgagctttc aggtagctct cattattttc attaatatac actgtttctg 

9241 ataaatctgc gtaggcacta gaatatgaaa gtgaaattaa cagcgaagca aataatttta 

9301 agctgttgtc atcatgttcc tggcaggcat taacaaacgt taaaaaacca gaacctattg 

9361 cttgaaaatg tatattcgct aaaggttcat tatccttaga ttcctcataa aatagctgaa 

9421 ccgtggaggg aaggccattc ttagtttcta catcaaaagt acattgttca atgggcattg 

9481 catctttatt atccccagga ttacatatac tattaatcgc gaaatgaaaa atagcccggt 

9541 caacaactga tgcaataaag tcacgatcgt ttaaatcctc atcagtgcaa tcgtcagtat 

9601 acttattgac ccacattttg agatcattta tgtgctcaat taaaaacaca tttttttacg 

9661 tgcatgccat aaattatgcg tgattcgttg aataaacaaa tctatttttg caataattat 

9721 atcccgatga cgcagcgctg cataatcaat agatttacca tatatattat ttaatttacc 

9781 tttgattgaa cctctattat ccaatttatc tttggttgca tctttatttt cgaattgatt 

9841 tgaattgagt gttttattaa agtatccagt tgcaccgggt attatcttca acatgattca 

9901 tctacctttt ttgtcgtacc gtgtcctcct tataattctt ttgccatctg gtcactatca 

9961 aaaatcgctc ataatttctg aactacaggt atgcgtttaa acccagttaa ataaatagcg 

10021 agttaactt 
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